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This work was supported in part by research 
grants HL/AL 33259, CA-47541 and CA-47858 from the 
5 National Institutes of Health. The U.S. Government has 
rights in the invention. 



TECHNICAL FIELD 



This invention relates to receptors for 
adhesion peptides, and more specifically to a novel 
10 receptor subunit having affinity for extracellular matrix 
molecules . 

BACKGROUND ART 



Multicellular organisms, such as man, have some 
10^* cells which can be divided into a minimum of fifty 

15 different types, such as blood cells and nerve cells. 

During the course of growth and development, cells adhere 
to other cells, or to extracellular materials, in 
specific and orderly ways. Such cell adhesion mechanisms 
appear to be of importance in mediating patterns of 

20 cellular growth, migration and differentiation, whereby 
cells develop specialized characteristics so as to 
function as, for example, muscle cells or liver cells. 
Cell adhesion mechanisms are also implicated in 
dedifferentiation and invasion, notably where cells lose 

25 their specialized forms and become metastasizing cancer 
cells. 



The mechanisms underlying the interactions of 
cells with one another and with extracellular matrices 
are not fully tinderstood, but it is thought that they are 
30 mediated by cell surface receptors which specifically 

recognize and bind to a cognate ligand on the surface of 
cells or in the extracellular matrix. 
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The adhesion of cells to extracellular matrices 
and their migration on the matrices is mediated in many 
cases by the binding of a cell surface receptor to an 
Arg-Gly-Asp containing sequence in the matrix protein, as 
5 reviewed in Ruoslahti and Pierschbacher, Science 238:491 
(1987). The Arg-Gly-Asp sequence is a cell attachment 
site at least in fibronectin, vitronectin, fibrinogen von 
Willibrand, thrombopondin , osteopontin, and possibly 
various collagens, laminin and tenascin. Despite the 
10 similarity of their cell attachment sites, these proteins 
can be recognized individually by their interactions with 
specific receptors. 



The integrins are a large family of cell 
surface glycoproteins that mediate cell-to-cell and cell- 

15 to-matrix adhesion as described, for example, in the 
Ruoslahti and Pierschbacher article cited above. All 
known members of this family of adhesion receptors are 
heterodimers consisting of an a and a 6 subunit 
noncovalently bound to each other. When the integrin 

20 family was first identified, integrins were grouped into 
three subfamilies based on the three B stibunits that were 
initially recognized (B^, Bg and B^) . Over the past few 
years, the primary stjnictures of three integrin B 
subunits from mammalian cells and one from Drosophila 

25 have been deduced from cDNA. 



Each a subunit was thought to associate 
uniquely with a single B subunit. Eleven distinct a 
subunits have thus far been described. As new integrins 
have been identified, however, it has become clear that 
30 this grouping is not entirely satisfactory, since there 
are clearly more than three B subunits and since some a 
subunits cem associate with more than one B subunit as 
described, for example, in Sonnenberg et al., J. Biol. 
OiaiL^ 265:14030-14038 (1988). 
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Because of the importance of integrins in 
mediating critical aspects of both normal and abnormal 
cell processes, a need exists to identify and 
characterize different integrins. The present invention 
5 satisfies this need and provides related advantages as 
well. 



SUMMARY OF THE INVENTTQK 



The present invention relates to a 
substantially purified B subunit of an integrin cell 
10 surface receptor designated as B^. The amino acid 
sequence of B^ is provided in Figure 3. 

The present invention also relates to amino 
acid fragments specific to B^ that have a variety of uses. 
The invention further relates to vectors having a gene 
15 encoding such fragments. Host cells containing such 

vectors are also provided. The nucleic acids encoding B^ 
as well as nucleic acids that specifically hybridize with 
the nucleic acids encoding B^ sequences are other aspects 
of the present invention. 



20 In a further aspect, the present invention 

relates to a substantially purified integrin comprising B^ 
bound to an a subunit, particularly or a^. Methods of 
blocking the attachment of the B^-containing integrins to 
its ligand and of detecting the binding of such integrins 

25 to its ligand are also provided. 



The present invention also relates to methods 
of increasing or decreasing cell adhesion in cells 
expressing a B^-containing integrin by overexpressing the 
integrin or by binding the integrin with a ligand, such 
30 as vitronectin. 
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TOTKF DESCP TPTIQN OF THE FIGURES 



Figure 1 shows the design of PGR primers. 

Figure 2 shows a map of sequencing strategy. 

Figure 3 shows the nucleotide sequence and 
5 amino acid translation for human (H) and guinea pig (GP) 

Figure 4 shows the alignment of with four 
previously reported integrin 6 svibunits. 

Figure 5 shows the alignment of partial 
10 nucleotide and amino acid sequences from human (H) and 
guinea pig (GP) 6,, 63, B3, and 6^ for the region just 
downstream from the B3F primer. 



DETAILED DESCRIPTION OF THE INVENTION 



15 The present invention provides a composition of 

matter relating to a novel, substantially purified 
integrin B subunit, referred to herein as B^. The amino 
acid sequence of B^ is also provided and is shown in 
Figure 3. 

20 By "substantially purified" is meant 

substantially free of contaminants normally associated 
with a native or natural environment. 

By "B^" is meant a polypeptide having 
substantially the same amino acid sequence and binding 
25 functions of the polypeptides encoded by the sequences 
set forth in Figure 3 for human and guinea pig B^. Thus, 
modified amino acid sequences that do not substantially 
destroy the functions and retain the essential sequence 
of B^ are included within the definition of B^. Amino 
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acid sequences, such as the sequence for B^, and B3, 
having less than 50% homology with the sequence of B^, are 
not substantially the same sequence and, therefore, do 
not fall within the definition of B^. Given the amino 
5 acid sequences set forth herein, additions, deletions or 
substitutions can be made and tested to determine their 
effect on the function of B^. In addition, one skilled in 
the art would recognize that certain amino acids, such as 
the conserved cystines, for example, can be modified to 
10 alter a binding function of fl^* 

Amino acids are identified herein by the 
standard one-letter abbreviations, as follows: 



15 



Amino Acid Symbol 



Alanine A 

Asparagine N 

Aspartic acid d 

Arginine r 

20 Cysteine C 

Glutamine q 

Glutamic acid e 

Glycine G 

Histidine H 

25 Isoleucine I 

Leucine L 

Lysine K 

Methionine M 

Phenylalanine p 

30 Proline p 

Serine S 

Threonine T 

Tryptophan W 

Tyrosine y 

35 Valine V 



Based on its sunino acid sequence, the 6 subunit 
of the present invention is clearly different from B-, 6, 
B3 and other 6 subunits that have recently been 
discovered. For example, the 11-amino acid carboxyl- 
40 terminal extension on B^ distinguishes it from B^, Bg, and 
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Bj. The short cytoplasmic tails of B^, Bg, and B3 are 
thought to be sites of interaction with the cytoskeleton 
and regions for the transduction of signals initiated by 
interactions of the large extracellular domains with 
5 ligands. These cytoplasmic tails may also be targets for 
regulation of integrin function. The distinctive 11- 
amino acid cytoplasmic tail of B^ indicates that its 
regulation or pathways for signal transduction may be 
different from those of B,, &2 ^• 

10 In addition to B,, and B3, recent studies 

have suggested the existence of as many as five other 
integrin B subunits. A 6 subunit with a molecular weight 
of approximately 210,000 (B^) has been foxind associated 
with the integrin a subunit "a^" in colon carcinoma cells 

15 and in a variety of other tumor cells of epithelial 

origin as described, for example, in Kajiji et al., EMBO 
J, , 8:673-680 (1989). On the basis of its high molecular 
weight, 210,000 compared with the predicted size of 
106,000 of the subject novel protein, and on the basis of 

20 its clearly different amino-terminal sequence, it is 
apparent that B4 is not the seune as the subject 
polypeptide. 

Another B subunit, originally called B^ was 
identified in epithelial-derived tumor cells in 

25 association with the integrin a subunit as described, 
for example, in Cheresh et al.. Cell 57:59-69 (1989). 
This B subunit, having a distinctive amino-terminal 
sequence, was recently renamed B5. Based on recent 
studies of purified preparations, 65 clearly differs from 

30 the B subunit of the present invention. Because the B 
subunit described in the present report is distinct from 
each of the five B subunits for which sequence 
information is available, it has been designated as B^. 



The existence of two other integrin B subunits 
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has been inferred from the identification of unique 
proteins after immunoprecipitation of surface-labeled 
cell lysates with antibodies to known a subunits. One of 
these novel proteins, called 65 was found in association 
5 with in the human osteosarcoma cell line H6-63, in the 
fibroblast cell line AF1523, and in human endothelial 
cells as described, for example, in Freed et al., EMBO J, 
8:2955-2965 (1989). This subunit is also different from 
since B3 is expressed in MG-63 cells while is not 
10 expressed in these cells as shown in Table !• 

The other novel integrin B subiinit identified 
by co-immunoprecipitation of known a subunits. Bp, is a 
protein of about 95,000 that is found to be associated 
with a^, an a subunit first found as part of the 
15 lymphocyte homing receptor VIiA-4 as described, for 

example, in Holzmann et al.. Cell 45:37-46 (1989). This 
subunit is also distinct from B^ since Bp is expressed in 
lymphocytes while B^ is not expressed in lymphocytes as 
shown in Table 1. 
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TABLE 1 

Distribution of 





Cell Lines: 


Type 


Results 


Source 


5 


FG-2 


Pancreatic 


+ 


Kajiji et al. 
EMBO J 3:673- 
80 (1989) 




Pane I 


Pancreatic 




Dr. Metzgar, 
Duke U., N.C. 




Colo-396 


Colon CA 


+ 


Dr. L. Walker, 
Cytel , San 
Diego, CA 




UCLA P3 


Lung CA 


+ 


Dr. L. Walker, 
Cytel, San 
Diego, CA 




Hela 


Cervical 




ATCC #CCL-2 


10 


Jar 


Chorio CA 


+ 


ATCC #HTB 36 




HT 1080 


Fibrosarcoma 




ATCC #CCL 121 




U 937 


Monocytoid 




ATCC #CRL 1593 




M 21 


Melanoma 




Dr. R. 
Reisfeld, 
Scripps Clinic 
& Research 
Foundation, La 
Jolla, CA 






fits X ClilwlUCl 




Dr. R. 

Reisfeld 
Scripps Clinic 
& Research 
Foundation, La 
Jolla, CA 


15 


MG 63 


Osteosarcoma 




ATCC #CRL 1427 




Tissues : 


Cervix 

Aortic Endothelium 
Leukocytes 


+ 
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The invention also provides an integrin 
comprising bound to an a subiinit. 6^, consistent with 
recent findings of other 6 subunits, can associate with a 
variety of a sxibunits to form a functional integrin. In 
5 one embodiment, associates with a^. In another 

embodiment, B^ associates with another a subunit referred 
to herein as Op. The B^ integrin, as well as other 
integrins containing B^, can bind molecules, for example 
extracellular matrix molecules. Such molecules are 

10 referred to herein as ligands. In a specific embodiment, 
certain 6^-containing integrins can bind Arg-Gly-Asp- 
containing polypeptides such as vitronectin or 
fibronectin. The binding of 6^-containing integrins to 
various ligands can be determined according to procedures 

15 known in the art and as described for example, in 

Ruoslahti & Pierschbacher, Science 238:491-497 (1987). 

The invention also provides an amino acid 
fragment specific to B^. Since B^ is a novel molecule, it 
contains memy fragments which are specific for this B 

20 subunit- Fragments specific to B^ contain sequences 
having less than 50% homology with sequences of other 
known integrin B subunit fragments. These fragments are 
necessarily of sufficient length to be distinguishable 
from known fragments and, therefore, are "specific for 

25 B^." The amino acid sequence of such fragments can 

readily be determined by referring to the figures which 
identify the B^ amino acid sequences. These fragments 
also retain the binding function of the B^ subunit and can 
therefore be used, for example, as immiinogens to prepare 

30 reagents specific for B^ or as an indicator to detect the 
novel B^-containing integrin of the present invention. 
One skilled in the art would know of other uses for such 
fragments. 

The invention also provides a reagent having 
35 specificity for an amino acid sequence specific for B.. 
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Since is a novel protein with at least 50% amino acid 
differences over related B subunits, one skilled in the 
art could readily make reagents, such as antibodies, 
which are specifically reactive with amino acid sequences 
5 specific for B^ and thereby immunologically distinguish B^ 
from other molecules. Various methods of making such 
antibodies are well established and are described, for 
example, in Antibodies, A Laboratory Manual . E. Harlow 
and D. Lane, Cold Spring Harbor Laboratory 1988, pp. 139- 
10 283 and Huse et al., science 24:1275-1280 (1988). 

The invention also provides nucleic acids which 
encode B^. Examples of such sequences are set forth in 
Figure 3. Following standard methods as described, for 
example, in Maniatis et al.. Molecular Cloning . Cold 

15 Spring Harbor (1982) , nucleic acid sequences can be 
cloned into the appropriate expression vector. The 
vector can then be inserted into a host, which will then 
be capable of expressing recombinant proteins. Thus, the 
invention also relates to vectors containing nucleic 

20 acids encoding such sequences and to hosts containing 
these vectors. 

The sequences set forth in Figure 3 also 
provide nucleic acids that cem be used as probes for 
diagnostic purposes. Such nucleic acids can hybridize 

25 with a nucleic acid having a nucleotide sequence specific 
for B^ but do not hybridize with nucleic acids encoding 
non-B^ proteins, particularly other cell surface 
receptors. These nucleic acids can readily be determined 
from the sequence of B^ and synthesized using a standard 

30 nucleic acid synthesizer. Nucleic acids are also 
provided which specifically hybridize to either the 
coding pr non-coding DNA of B^. 

Integrin cell surface receptors bind ligands, 
such as extracellular matrix molecules. However, the 
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binding of the integrin to the ligand can be blocked by 
various means. For example, the binding of a 6^- 
containing integrin can be blocked by a reagent that 
binds the 6^ subunit or the B^-containing integrin. 
5 Examples of such reagents include, for example, Arg-Gly- 
Asp-containing peptides and polypeptides, ligand 
fragments containing the integrin binding site, as well 
as antibodies specifically reactive with fi^ or a fi^- 
containing integrin. Alternatively, the blocking can be 

10 carried out by binding the ligand or fragment thereof, 
recognized by a B^-containing integrin with a reagent 
specific for the ligand at a site that inhibits the 
ligand from binding with the integrin. Since the binding 
of a B^-containing integrin to its ligand can mediate cell 

15 adhesion to an extracellular matrix molecule, preventing 
this binding can prevent cell adhesion. Alternatively, 
cell adhesion can be promoted by increasing the 
expression of B^-containing integrins by a cell. 

Finally, the invention provides a method of 
20 detecting ligands which bind a B^-containing integrin. 
The method comprises contacting a B^-containing integrin 
with a solution containing ligands suspected of binding 
B^-containing integrins. The presence of ligands which 
bind a B^-containing integrin is then detected, 

25 In summary, the invention claims: 

1. A substantially purified integrin cell 
surface receptor subunit comprising B^. 

2. The substantially purified integrin cell 
surface receptor subunit of claim 1 having the euaino acid 

30 sequence set forth in Figure 3 for himan. 

3. A substantially purified integrin 
comprising B^ bound to an a subunit. 
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4. The integrin of claim 3, wherein the 
siibunit is a^. 



5. The integrin of claim 3, wherein the 
subunit is a^* 

5 6. A substantially purified amino acid 

fragment specific to B^. 

7. A vector comprising a gene encoding for 
the euaino acid fragment of claim 6. 

8. A host containing the vector of claim 7. 

10 9. A reagent having specificity for an amino 

acid sequence specific for fi^. 

10. The reagent of claim 9, wherein the 
reagent is an antibody. 

IX. A substantially purified nucleic acid 
15 encoding 6^. 

12. A substantially purified nucleic acid 
which specifically hybridizes with a nucleotide sequence 
of the nucleic acid of claim 11. 

13. A substantially purified nucleic acid 
20 which specifically hybridizes with the nucleic acid of 

claim 12 and does not hybridize with a nucleic acid 
encoding a non-6^ polypeptide. 
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14. A method of preventing the binding of a 
cell expressing a 6^-containing integrin to ligand capable 
of binding to said B^-containing integrin, comprising 
blocking the binding of the B^-containing integrin and the 
5 ligand. 



15. The method of claim 14, wherein the 
blocking is effected by binding the B^-containing integrin 
with a reagent specific thereto. 

16. The method of claim 14, wherein the 
10 blocking is effected by binding the ligand of the B^- 

containing integrin with a reagent specific for the 
ligand. 



17. The method of claim 15, wherein the 
reagent is an RGD-containing peptide or polypeptide. 

15 18. The method of claim 15, wherein the 

reagent is a ligand fragment containing an integrin 
binding site. 

19. A method of detecting a ligand that binds 
a B^-containing integrin, comprising contacting the 6^- 
20 containing integrin with a solution containing the ligand 
suspected of binding B^-containing integrins and detecting 
the presence of the ligand bound to the B^-containing 
integrin. 



20. A method of increasing cell adhesion in 
25 cells expressing a B^-containing integrin, comprising 
overexpressing the B^-containing integrin in a cell. 



30 



21. A method of decreasing cell adhesion in 
cells expressing a B^-containing integrin comprising 
binding the B^-containing integrin with a ligand. 
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The following examples are intended to 
illustrate but not limit the invention. 

EXAMPLE I 
Identification of a Novel B Subunit 

5 Generation of cDNA Fragments by Polymerase Chain Rea ction 

Tracheal epithelial cells harvested from male 
Heurtley outbred guinea pigs (Charles River Breeding 
laboratories. Bar Harbor, ME) were grown to confluence 
over 10-14 days on collagen-impregnated microporous 

10 filters commercially avail£Q>le from Costar. RNA was 
harvested from these primary cultures, and mRNA was 
purified over oligo (dT) -cellulose columns using the Fast 
Track mRNA isolation kit (Invitrogen, San Diego, 
California) • Two to 5 /ig of mRNA was used as a template 

15 for cDNA synthesis catalyzed by 200 units of Moloney 
murine leukemia virus reverse transcriptase (Bethesda 
Resecurch Laboratories, Gaithersburg, MD) in a 20-40 Ml 
reaction volume. One to 5 fil of the resultant cDNA was 
used as a template for polymerase chain reaction (PCR) . 

20 PCR was ceurried out in a reaction volume of 25-200 fil. 
In addition to the template cDNA, each PCR reaction 
contained 50 mM KCl, 10 mM Tris-HCl (pH 9.0 at 25 'C), 1.5 
mM MgClg, 0.01% gelatin, 0.1% Triton X-100, 0.2 mM each of 
dATP, dGTP, dCTP and dTTP, and 0.05 units/^l Tag DNA 

25 polymerase (obtained from either United States 

Biochemical Corporation, Cleveland, OH, or from Promega, 
Madison, Wl) • 

For each reaction, two oligonucleotide primers 
were also added to obtain a final concentration of 1 mM 
30 each. The primer pairs are identified below. Each 

reaction mixture was overlaid with mineral oil, heated to 
95 'C for 4 min. in a thermal cycler (Ericomp, San Diego, 
CA), and then subjected to 30 cycles of PCR. Each cycle 
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consisted of 45 seconds at 95 'C, 45 seconds at 53 'C, and 
1 min. at 72 'C. Unmediately after the last cycle, the 
sample was maintained at 72 for 10 min. 



The results of each PGR reaction were analyzed 
5 by gel electrophoresis in 1.5% agarose. Reactions that 
produced fragments of the expected size were 
electrophoresed in 1.5% low gel temperature agarose (Bio- 
Rad Laboratories, Richmond, CA) . The appropriate size 
band was excised, melted at 68 *C, and the DNA was 
10 purified by extraction with phenol/chloroform and 
precipitation in ethanol and ammonium acetate. 



PCR Primers 



To obtain the initial fragment of the novel B 
subunit cDNA described herein, degenerate mixtures of PCR 

15 primers were used. Oligonucleotides were synthesized, 

trityl-on, by the University of California, San Francisco 
Biomolecular Resource Center using a DNA synthesizer with 
standard procedures, and purified over Nen-sorb 
cartridges (DuPont-New England Nuclear, Boston, MA) . 

20 These consensus primer mixtures were designed to anneal 
with the nucleotides encoding the highly conserved 
sequence Asp-Leu-Tyr-Tyr-Leu-Met-Asp-Leu (primer BIF) and 
Glu-Gly-Gly-Asp-Ala-Ile-Met-Gln (primer B2R) that flank 
an approximately 300-nucleotide region beginning 

25 approximately 130 amino acids from the sunino terminus of 
each of the integrin fl subunits sequenced to date. The 
sequences of the primers identified herein are depicted 
in Figure 1. 



On the basis of the initial sequence obtained, 
30 a specific forward primer was designed to anneal with the 
sequence encoding the amino acids Pro-Leu-Thr-Asn-Asp- 
Ala-Glu-Arg (primer BTE2F) ending approximately 49 
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nucleotides from the 3" end of the region we had 
sequenced. We also designed an additional forward primer 
(B3F) and two reverse primers (B3R and B4R) to recognize 
highly conserved consensus regions encoding the sequences 
5 Gly-Glu-Cys-Val-Cys-Gly-Gln-Cys (B3 region) and Ile-Gly- 
Leu-Ala-Leu-Leu-Leu-Ile-Trp-Lys (B4 region) . The 
alignment of these primers with previously published 
sequences of human B^, and 63 and chicken is shown in 
Fig. 1. PCR as described 2a)ove was performed with cDNA 
10 from guinea pig tracheal epithelial cells and the primer 
pairs BTE2F/B3R and B3F/B4R. 

The primer pair BTE2F/B3R yielded 1095 
additional base pairs of new sequence. Based on this 
sequence another specific primer (BTE3F) was designed to 
15 recognize the sequence Val-Ser-Glu-Asp-Gly-Val near the 
3» end of this sequence, and PCR was performed with this 
primer in combination with primer B4R. 

Figure 1 shows the design of PGR primers. B 
subunit consensus primer mixtures were designed on the 

20 basis of alignment of published sequences of human fi^, B2, 
Bj and chicken B,. For forward primers (BIF and B3P) , the 
primer sequences included a single nucleotide whenever 
possible for each of the first two nucleotides of each 
codon and were usually either degenerate or included 

25 deoxyinosine for the third base in codons for amino acids 
other than methionine. Reverse primers (B2R, B3R, and 
B4R) were designed in the same manner for the 
complementary DNA strand. Two specific forward primers 
were designed to recognize B^. The first (BTE2F) was 

30 designed to work across species and was thus degenerate 
or included deoxyinosine in the third codon position. 
The second, BTE3F, was not degenerate and was designed to 
only recognize guinea pig B^. 
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Cloning of Fragments Obtained bv PCR 

Individual fragments were cloned in pBluescript 
(Stratagene, San Diego, CA) as follows. Purified 
fragments were resuspended in distilled water containing 
5 deoxynucleotides and treated with 2.5 units of DNA 

polymerase I, large fragment (Promega) to fill in any 3* 
recessed ends left after the last cycle of PCR. The 5» 
ends were phosphorylated with 5 units of T4 
polynucleotide kinase (New England Biolabs, Beverly, MA) . 

10 An aliquot of the above reaction mixture containing 
approximately 100-200 ng of DNA, was ligated into 
pBluescript that had been cut with EcoRV (Promega) and 
dephosphorylated with calf intestinal alkaline 
phosphatase (Boehringer Mannheim, Indianapolis, IN) . 

15 Ligations were performed at 22 'C for 1 hour with T4 DNA 
ligase (Bethesda Research Laboratories) . The ligation 
mixture was used to transform competent Escherichia coli 
(JM109, Clontech, San Francisco, California). Plasmids 
containing inserts were purified using the Pharmacia 

20 miniprep lysis kit (Pharmacia LKB Biotechnology, Inc., 

Piscataway, NJ) denatured in 0.3 M NaOH, further purified 
over spin columns containing Sephacryl S-400 (Pharmacia) , 
and then sequenced using the Sequenase™ version 2.0 
sequencing kit (United States Biochemical Corp., 

25 Cleveland, OH) and [^^S]dATP (Amersham Corp., Arlington 
Heights, IL) . 

Library Screening 

PCR fragments generated with the primer pairs 
B1F/B2R and BTE3F/B4R were uniformly labeled with alpha- 
30 [^^P]dCTP and used as probes to screen a random-primed 
cDNA library and an oligo-dT-primed cDNA library both 
constructed in the plasmid pTZ18R-BstXI (Invitrogen) from 
mRNA obtained from the human pancreatic carcinoma cell 
line FG-2. Plasmid was purified from clones found to 
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hybridize with either region, and inserts were sequenced. 
A portion of insert DNA from one clone was in turn 
labeled and used to screen the same libraries. Fourteen 
independent overlapping clones were sequenced from both 
5 ends using primers that recognize regions of the pTZ 
polylinker. The regions flanking the 3' end of the 
putative translated region of the new B subunit were 
sequenced in both directions from three clones using 
primers constructed to recognize sequences close to the 

10 3* end. On the basis of the initial sequences thus 
obtained, an additional internal sequence was obtained 
from clones TlO, Til, T12 and T14 (Fig. 2) after 
digestion with specific restriction endonucleases and 
relegation. Three internal fragments thus generated were 

15 subcloned into pBluescript and were also sequenced in 
both directions. Approximately 90% of the new sequence 
reported was obtained from both strands of DNA, and 97% 
was obtained from two or more overlapping clones (Fig. 
2). 

20 Figure 2 shows a map of the sequencing 

strategy. Shown are the loca.tion of clones used to 
obtain the partial cDNA sequence of guinea pig (clones 
IF, 3L, 3N and 3Y, top) and the complete sequence of 
human B^ (clones T1-T19 bottom) . Also shown is the 

25 location of the translated region (Protein) . The 

location of the transmembrane domain is shown by the 
letters TM. Clones shown often represent one of several 
identical clones. Internal sequence of clones with long 
inserts was obtained by restriction endonuclease 

30 digestion and relegation and by ligation of internal 

fragments into pBluescript. Specific restriction sites 
employed are shown (Hind, Hindlll; Hinc, Hindi; Kpn, 
Kpnl; Pst, PstI). The direction and extent of sequencing 
are shown by arrows. 1109 and 1110 are the sites 

35 recognized by oligonucleotide sequencing primers. T18 
and T19 each terminated in a poly (A) tail. The regions 
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recognized by the degenerate PGR primers BIF (Bl) , B2R 
(B2), B3R/F (B3)*, and B4R (B4) and the primers BTE2F 
(BTE2) and BTE3F (BTE3) are noted above the guinea pig 
cDNA map, kb, kilobases. 

5 Nucleotide Sequence of a Novel Guinea Pia Inteartn B 
Subunlt 

PGR using cDNA from guinea pig airway 
epithelial cells and the consensus primer mixtures BIF 
and B2R (Fig. 1) amplified DNA fragments with the 

10 expected size of approximately 350 nucleotides. When the 
fragment DNA was sequenced after cloning into 
pBluescript, recombinant clones each contained inserts 
with one of two distinct sequences. One sequence encoded 
a stretch of 98 amino acids that was 97% identical to the 

15 expected region of human B^ and was therefore presumed to 
be guinea pig B^. The other sequence encoded 98 amino 
acids that were only 53% identical to human fi,, 45% 
identical to human B2,. and 57% identical to human B3 (Fig. 
2 , clone IF) • Both of the guinea pig sequences included 

20 the integrin B subunit consensus sequences Ser-X-Ser- 
Met-X-Asp-Asp-Leu and Gly-Phe-Gly-Ser-Phe-Val, and both 
contained the 2 cysteine residues found in this region in 
all known integrin B subunits. These data suggest that 
one of the two sequences we obtained encoded a new member 

25 of the integrin B subunit family. 

This novel sequence was extended by further PGR 
steps utilizing primers specific for the novel sequence 
(BTE2F, BTE3F) in combination with two additional 
degenerate primers (B3R and B4R, see Figs. 1, 2 and 4). 
30 With the primer pair BTE2P/B3R two different cDNA 

products were obtained (3Ii and 3N in Fig. 2) due to an 
unexpected hybridization of the B3R primer with a site 
220 nucleotides further downstream (B3* in Fig. 2). The 
1732-nucleotide sequence determined from these clones is 
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shown in Fig. 3. 

Figure 3 shows Nucleotide sequence and amino 
acid translation for human (H) and guinea pig (GP) B^. 
The amino acid translation is denoted by the single 
5 letter code beneath the second nucleotide of each codon 
from the translated region of human 6^. For the guinea 
pig sequence, only amino acids that differ from the human 
sequence dre shown. The numbers along the right-hand 
margin denote the nucleotide or amino acid number of the 
10 last entry on each line. The numbering system used 

starts with the first nucleotide or amino acid available 
for each sequence shown. The nine potential sites for N- 
glycosylation in the putative extracellular domain of 
human 6^ are underlined. 

15 Nucleotide Sequence, of Human fi^^ 

Screening of cDNA libraries constructed from 
the human pancreatic carcinoma cell line FG-2 with guinea 
pig CDNA probes IF and 3Y (see Pig. 2) and subsequent 
screening with a probe constructed from a portion of 

20 clone TIO (Fig. 2) produced 14 independent positive 

clones. The two longest clones (T18 and T19) extended to 
the poly (A) tail. A map of these clones, constructed on 
the basis of sequence information and of the mobility of 
inserts cut out of these clones in agarose gels is shown 

25 in Fig. 2. This map predicts an mRNA of approximately 5 
kilobases including at least a 226-nucleotide 
untranslated region at the 5' end and, a 2364-nucleotide 
open reading frame, and a 3 • untranslated region of 
approximately 2.5 kilobases. This molecule has been 

30 termed integrin B^. 



Fig. 3 shows the partial nucleotide and 
complete amino acid sequences for human B^ (excluding most 
of the 3"-iantranslated region) and the alignment of the 
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1732 nucleotides of sequence obtained from PGR of guinea 
pig airway epithelial cell cDNA. Of the 577 amino acids 
deduced from the region sequenced in both species only 36 
residues differ; the amino acid sequences are 94% 
5 identical. Furthermore, of the 1732 nucleotides 
sequenced in both species, 91% are identical. Nine 
potential glycosylation sites present in the putative 
extracellular domain of human are shown by underlining. 
All seven of these sites that lie within the 577 amino 
10 acids obtained for guinea pig are also present in the 
guinea pig protein. If all of the potential 
glycosylation sites are occupied with oligosaccharides 
having an average molecular weight of 2,500, the 
predicted molecular weight of human B^ would be 106,000. 



^5 Comparison of the 788-amino acid sequence 

deduced from the open reading frame to the three 
previously sequenced human B subunits and the myospheroid 
protein of Drosonhiia is shown in Fig. 4. 

Figure 4 shows the alignment of B^ with four 
20 previously reported integfin B subunits. Previously 

published sequences for human B^, human Bg, human B5, the 
myospheroid gene product (Bmyo) of Drosophila , and the 
novel sequence described as (6^) are shown using the 
single letter cuaino acid code. The 56 conserved 
25 cysteines are noted by * and the 120 other invariant 
amino acids by = above each line. The transmembrane 
domain is underlined. The regions used for constructing 
the consensus B subunit primers BIF (1), B2R (B2) , B3F/R 
(B3) , and B4R (B4) are labeled below the alignment in 
30 bold type. The numbers along the right-hand margin 
denote the number of the last amino acid in each line 
beginning from the first amino acid of each putative 
signal sequence. 
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There are 179 amino acid residues that are 
identical in each of the other B subxmits and in 6^ 
including 56 conserved cysteine residues. The overall 
percentage of identical amino acids between 6^ and the 
5 other human 6 subunits is 47% for B3, 42% for 6, and 38% 
for Human is also 39% identical to the Drosophila 

B subunit. Human B^, B^ and B3 and the Drosophila B 
subunit all have cytoplasmic regions consisting of 41 
amino acids (beginning after the putative transmembrane 

10 domain shown by the underline in Fig. 4) . Although B^ 
contains each of the 10 conserved amino acid residues in 
this cytoplasmic region it also contains an 11-amino acid 
extension at the carboxyl terminus. B^ also contains two 
Arg-Gly-Asp sequences , one at amino acids 514-516 and the 

15 other at 594-596. These regions could serve as 

recognition sites for other ligands of the integrin 
family. 

PGR using the primer pair B3F/B4R (see Fig. 1) 
amplified fragments of the expected size of approximately 

20 750 nucleotides. Cloning and sequencing of the fragments 
did not result in any additional clones containing the 
novel B subunit sequence but did result in several clones 
with inserts encoding an amino acid sequence that was 97% 
identical to the corresponding region of human B3 and 

25 several others encoding an amino acid sequence that was 
93% identical to human B, (Fig. 5) . These are presumably 
the guinea pig homologues of 6^ and B3, respectively. The 
nucleotide sequences of guinea pig and humzm B, are 80% 
identical, and those of guinea pig and human B3 are 91% 

30 identical; 

Figure 5 shows the alignment of partial 
nucleotide and amino acid sequences from Human (H) and 
guinea pig (GP) B^, B^, Bj, and B^ for the region just 
downstream from the B3F primer. Amino acid tremslations 
35 denoted by the one-letter code are shown below the second 
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nucleotiide of each codon. For the guinea pig sequences, 
only amino acids that differ from the human sequences are 
shown. The numbers shown along the right-hand margin 
denote the nucleotide number for human 6^. The sequences 
5 for human B| emd 63 are from previously piiblished reports. 

EXAMPLE IT 
Bg Associates with And Subunite 

To determine that the novel B subunit of the 
present invention is associated with an a chain similar 

10 to other known integrins^ antisera against peptides from 
the cytoplasmic domain sequence of fl^ were prepared. The 
following amino acid peptides from the cytoplasmic 
sequence of 6^ were prepared and used to immunize rabbits: 
RGSTSTFKNVTYKHR (residues 763-777) and YKHREKQKVDLSTDC 

15 (residues 774-788) . The antisera were raised in rabbits 
according to standard procedures known in the art. 
Briefly^ peptides were chemically coupled to keyhole 
lympet hemocyanin, and were injected in rabbits in either 
complete (first injection only) or incomplete Freund*s 

20 adjuvant as described, for example, in Antibodies s A 
Laborat:ory ?fanuaj,, E. Harlow and D* Lowe, eds.. Cold 
Spring Harbor Laboratory, Cold Spring Harbor, New York 
11724. Antisera were termed 6830 (to peptides 
corresponding to residues 763-777) and 6341 (to peptides 

25 corresponding to residues 774-788) . 

The resulting polyclonal antibodies were used 
to immunoprecipitate detergent lysates from the 
pancreatic carcinoma cell line FG-2 that had been surface 
radioiodinated according to procedures well known in the 
30 art such as described, for example, in Kajiji et al., 
EMBO J 3:673-680 (1989). A complex of two bands was 
precipitated of respectively 150 kilodaltons (Kd) and 97 
Kd in SDS-PAGE txnder non-reducing conditions. Under 
reducing conditions, the two bands migrated as a diffused 
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band, extending from 130 Kd to 116 Kd. These bands were 
specific since pre-immune serum did not precipitate any 
of them and they were not present when the 
immunoprecipitation was carried out in the presence of 
5 the corresponding immunogenic peptide. Fuirthermore, the 
same complex of two bands was precipitated by both the 
6830 and 6841 antibodies, which were raised against 
independent peptides from the cytoplasmic sequence 
deduced from 6^ cDNA clones. 



10 To determine which of the two precipitated 

bands corresponds to B^, a SDS-heat denaturated lysate 
from surface-radioiodinated FG-2 cells was 
immunoprecipitated with the 6841 antibody. Only the 97 
Kd band was detectable (non-reducing conditions) , 

15 identifying it as the 6^ band. Under reducing conditions, 
the apparent molecular weight of this band increased to 
116 Kd suggesting the presence of many intra-chain 
disulfide bonds, which is consistent with the primary 
structure of and of. other integrin 6 chains. 

20 The other band, of 150 Kd or 130 Kd under non- 

reducing or reducing conditions, respectively, is likely 
to be an a subunit since it dissociates after SDS-heat 
denaturation of the lysate, indicating that it is non- 
covalently associated with the B^ polypeptide. 

25 Furthermore, similar to certain other integrin a chains, 
its molecular weight decreases under reducing conditions 
by about 20 Kd (130 Kd versus 150 Kd under non-reducing 
conditions) probably due to a disulfide linked small 
peptide that dissociates upon reduction. 

30 To identify which a chain is associated with 6^, 

the aB^ integrin complex was purified by immuno-af finity 
chromatography on a 6841-protein A sepharose matrix 
according to procedures well known in the art such as 
described, for example, in Kajiji et al., EMBO J 3:673- 
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680 (1989). The eluted material was inununoprecipitated 
with antibodies specific for a^, a^, a^, a^, and a^, 
which are known to be expressed in FG-2 cells. Only the 
anti-tty monoclonal antibody 142.19, obtained from David 
5 Cherish, Ph.D., Scripps Clinic and Research Foundation, 
La Jolla, California, reacted with the purified material, 
which indicates that the is associated with in this 
pancreatic carcinoma cell line. 



To confirm this data, immunodepletion 
10 experiments on surface-radioiodinated FG-2 lysates were 
performed according to methods well known in the art such 
as described in Kajiji et al*, EMBO J 3:673-680 (1989). 
The cell lysate was depleted with the 6841 anti-B^ 
antibody or, in parallel, with a control antiserum, and 
15 then immunoprecipitated with the 142.19 anti-a^ antibody. 
A smaller amount of Oy was present in the 
immunoprecipitation on the B^ depleted lysate and no 97 Kd 
B^ band was visible. Instead, a smaller band of about 90 
Kd was present. It is hypothesized that this smaller 
20 band represents the Bj chain also associated with in 
these cells. In the control lysate depleted with normal 
rabbit serum, all three bands, 150 Kd (a^) , 97 Kd (B^) and 
90 Kd (65) were present after immunoprecipitation with the 
anti-tty 142.19 antibody. 

25 Another immunodepletion was carried out using 

142.19 antibody as the depleting antibody, or in parallel 
a mouse monoclonal as a control antibody. 
Immunoprecipitations of a^-depleted lysate with anti-a^ 
142.19 antibodies did not show the presence of any band, 

30 indicating that all a^-containing integrins had been 
removed. However, the 6841 anti-B^ antibody still 
precipitated a complex of two bands, one corresponding to 
B^, the other with a molecular weight close to that of a^. 
This a chain, however, must differ from since it is 

35 unreactive with anti-a^ monoclonal antibodies and is 
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referred to herein as a^. In the control depleted 
lysates, the 6841 anti-B^ antibody precipitates much 
stronger bands, consistent with the possibility that, in 
FG-2 cells, two integrins exist, a^B^ and OEpB^. 

5 Although the invention has been described with 

reference to the presently preferred embodiment, it 
should be understood that various modifications can be 
made without departing from the spirit of the invention. 
Accordingly, the invention is limited only by the claims. 
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WE CLAIM: 



1. A svibstantially purified integrin cell 
surface receptor subunit comprising B^. 

2. The substantially purified integrin cell 
surface receptor subunit of claim 1 having the amino acid 
sequence set forth in Figure 3 for human. 



3. A substantially purified integrin 
comprising bound to an a subunit. 

4. The integrin of claim 3, wherein the 
subunit is a^. 

5. The integrin of claim 3, wherein the 
subunit is a^. 

6. A substantially purified amino acid 
fragment specific to B^. 



7. A vector comprising a gene encoding for 
the amino acid fragment of claim 6. 



8. A host containing the vector of claim 7. 

9. A reagent having specificity for an amino 
acid sequence specific for B^. 

10. The reagent of claim 9, wherein the 
reagent is an antibody. 



11. 

encoding B^. 



A substantially purified nucleic acid 
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12. A sxibstantially purified nucleic acid 
which specifically hybridizes with a nucleotide sequence 
of the nucleic acid of claim 11. 

13. A substantially purified nucleic acid 
which specifically hybridizes with the nucleic acid of 
claim 12 and does not hybridize with a nucleic acid 
encoding a non-B^ polypeptide. 

14. A method of preventing the binding of a 
cell expressing a B^-containing integrin to llgand capcJ:>le 
of binding to said B^-containing integrin, comprising 
blocking the binding of the B^-containing integrin and the 

5 ligand. 

15. The method of claim 14, wherein the 
blocking is effected by binding the B^-containing integrin 
with a reagent specific thereto. 

16. The method of claim 14, wherein the 
blocking is effected by binding the ligand of the B^- 
containing integrin with a reagent specific for the 
ligand. 

17. The method of claim 15, wherein the 
reagent is an RGD-containing peptide or polypeptide. 

18. The method of claim 15, wherein the 
reagent is a ligand fragment containing an integrin 
binding site. 

19. A method of detecting a ligand that binds 
a B^-containing integrin, comprising contacting the B^- 
containing integrin with a solution containing the ligand 
suspected of binding B^-containing integrins and detecting 

5 the presence of the ligand bound to the B^-containing 
integrin. 
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20 • A method of increasing cell adhesion in 
cells expressing a 6^-containing integrin, comprising 
overexpressing the B^-containing integrin in a cell. 

21. A method of decreasing cell adhesion in 
cells expressing a 6^-containing integrin comprising 
binding the 6^-containing integrin with a ligand. 
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IB6HUMAN TAAACACA6CTTTTCT6CTTTACCT6TCCA66TAGCCTCTGnTTCATT 

P6H AGTGTAAGTAGTATTTAAAATGTTATACnCAAGAAAGAAAGACTnAACG 

P6H CTCGCACAGCAAGAACTGAAAC6AATGGG6ATT6AACTGCTTT6CCTGTTC 

M6IELLCLF 
P6H G6T6CAGAAACCT6T6AAGACT6CCT6CTTATT66ACCTCAGT6TGCCT66 

GAETCEDCLLI6PQCAW 
PSH ACCCCAGCAAACCTTTTAGCTAAAGGATGTCAATTAAACnCATCGAAAAC 

TPANLLAKGCQLNFIEN 
P6H CAGAAAAATAGnCTGACAnGnCAGATTGCACCTCAAAGCTTGATCCTT 

Q K _N _S S DIVQIAPQSLIL 
P6H GA6GACT7^Crarr66ATTT6TATTACCTCAT66ACCTCTCCGCCTCCATG 

EDYPVDLYYLMDLSASM 

P5GUINEA PIG TCC6CCTCCAT6 

P6H ATGTCTAAATTAACCAGCAACTnAGACTGGGCTTCGGATCnTTGTGGAA 
MSKLTSNFRL6FGSFVE 
P6GP AT6TCTAAATTAACTA6CAACTTTA6ACT66GCnCG6CTCTTTT6TA6AA 

P6H TGCAGTAGTAnCCATACTTCTGTnACCTACAniGGATTCAAGCACATT 

CSSIPYFCLPTF6FKHI 
P66P TGCAGTAGTAnCCATATATaGCTTACCTACAniGGATTCAAGCACATT 

P6H AAAATnCTGCTAATATTGACACACCCGAAGGTGGATTTGATGCAAnATG 
KISANIDTPE66FDAIM 
P66P AAAAnTCTGCTAATAnGACAACCCTGAAGGTGGATTCGACGCCAnATG 

P6H CTCCTGGTCTnGTGAGTGATGCTGATTCTCATmGGAATGGACAGCAAA 
LLVFVSDADSHF6MDSK 
P6GP CTCCTAGTCTTCGTGAGTGATGCCGATTCTCATnTGGAATGGACAGCAAA 

PSH AATGAATACTCCATGTCAACTGTCTTGGAATATCCAACAATTGGACAACTC 
NEYSMSTVLEYPTIGQL 
B6GP AATGAATACTCCATGTCAACTGTCATGGAATATCCAACAATTGGACAACTC 

M 

P6H GAACAAGTTCATnATATGAGAAnACGCAAAACTTAnCCTGGAGCTACA 
EQVHLYENYAKLIP6AT 
P6GP GAACAAGnCCACTATATGAGAAnATGCAAAACTTATTCCTGGAGCCACA 
P 

P5H GCTTATGAAGAACTGCGGTCTGAGGTGGAACTGGAAGTAnAGGAGACACT 
AYEELRSEVELEVLGDT 
P6GP GCTTATGAAGAACTGCGGTCTGAGGTGGAGCTGGAAGTATTAGGAGATACA 

FIG. 3A 
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TCAGTCnAATGAAAACTnCTAACnATATCTCAAGniCTTTTCAAAGC 100 
ATATTCAGCGnGGTCnGTAACGCTGAAGGTAAnCATnTTTAATCGGT 202 

nTCTATTTCTAGGAAGGAATGATTCACGTACAAGGTGGCTGTGCCTGGGA 30^ 

FLFL6RNDSRTRWLCL6 26 

T6TGCTCAGGAGAATTTTACTCATCCATCTG6A6TT6GCGAAA6GT6T6AT ^06 

CAGE N F T HPSGVGERCD 60 

CCTGTCTCCCAAGT7TOATACTTAAAAATAAGCCTCTCA6TGTAGGCA6A 508 

PVSQVEILKNKPLSVGR 94 

AAGnGAGACCAGGTGGTGCGCAGACTCTGCAGGTGCATGTCCGCCAGACT 610 

KLRPGGAQTLQVHVRQT 128 

GATGACGACCTCAACACAATAAAGGAGCTG6GCTCCG6CCTTTCCAAAGAG 712 

DDDLNTIKELGSGLSKE 162 

GACGATGACCTCAACACAATCAAAGAGCTGGGCTCCCTGCTnCAAAGGAG 63 

AAACCTGTATCCCCnTTGTGAAAACAACACCAGAAGAAATTGCCAACCCT SU^^^ 
KPVSPFVKTTPEEIANP 196 
AAACCCGTCTCCCCTTTTATGAAAACAACACCAGAGGAAAnGCCAACCCT 165 

TTGCCAnGACAAATGATGCTGAAAGATTCAATGAAATTGTGAAGAATCAG 916 
LPLTNDAERFNEIVKNQ ,230 
CTGCCAnGACAAATGATGCTGAAAGATTCAATGAAAnGTGAAGAAACAG 267 

CAAGCTGCTGTGTGTAAGGAAAAAAnGGCTGGCGGAATGACTCCCTCCAC 1018 
QAAVCKEKIGWR N _D_? L H 264 
CAAGCTGCTGTGTGTAAGGAAAAAAnGGCTGGCGGAtreATTCGCTCCAT 369^^^ 

CTAGCAGGCATCGTCAnCCTAATGACGGGCTCTGTCACTTGGACAGCAAG 1120^^^ 
LAGIVIPND GLCHLDSK 298 
CTGGCAGGCAnGTCATTCCCAACGATGGGCTGTGTCACTTGGACAGCAAG 471 

157 

AnGATAAACTGGTACAAAACAACGTGTTATTGATCTTCGCTGTAACCCAA 1222 

IDKLVQNNVLLIFAVTQ 332 
AnGATAAAGTGGTACAAAACAATGTGnAGTGATCTTTGCTGTAACCCAA 573^^^ 

GTA66TCTACTTCAGAA6GACTCCGGAAACATTCTCCAGCTGATCATCTCA 1324 
VGLLQKDS6NILQLIIS 366 

GTGGGGCTACTTCACAAGGACTCTGGAAACAnCTCCAACTGATCATCTCA 675^^^ 
H 225 

GAAGGACTCAACnGTCATTTACAGCCATCTGTAACAACGGTACCCTCTTC 1426, 
E 6 L N L S F T A I C N N G J L F ,,^400 

GAGGGCCTCA/nXTtrCGTTCTCAGCTGTCTGTAACATrrGOTTCTCTTC 777^^^ 

FIG. 3B 
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B6H CAACACCAAAA6AAAT6CTCTCACAT6AAAGT66GA6ACACAGCTTCCTTC 
QHPKKCSHMKVGDTASF 

B6GP CCACACCAAAAGAAATGCnGCACATGAAAGTGGGAGAAACAGCTTCAnC 
P L E 

m ATAAAGCCTGTGGGGCTGGGGGATGCCCTGGAATTACnGTCAGCCCAGAA 
IKPVGLGDALELLVSPE 

B5GP ATAAA6CCTGT66GGCTGG66GACACCCTGGAAATCCTT6TCAGCCCAGAA 

T I 

P6H CAC6GGAAC6GCTCTTTCCAGTGTGGG6TGTGTGCCT6CCACCCT6GCCAC 
H G N 6 S FQCGVCACHP 6_H 

P66P AATGGGATraCTCCTACCAGTGTGGGGTGTGTGCCTGTAACCCAGGCCAC 
NY N 

B6H AAG6AGGCCCCAGATCATCCCTCCTGCAGCGGAAGGGGTGACTGCTACT6T 
KEAPDHPSCSGRGDCYC 

P66P AAGGA6ACCCCA6ACCATCCCTCGTGCA6CGGAA6G66T6ACT6CTACT6T 

B6H TGCCAGT6TGACAATTTCTCCTGCGTGAGACACAAAGGGCTGCTCTGCGGA 
C Q C D NFS CVRHKGLLCG 
P66P TGCCAGTGTGACAmtntCTGTGTGAGGCACAAAGGGCTGCTCTGTGGA 

P6H GGCGAGTACTGCAACTGCACCACCAGCACGGACTCCTGCGTCTCTGAAGAT 
G E Y C N__C T TSTDSCVSED 
P6GP GGAGAGTACTGCAATimtCACCAGCACAGACACCTGCATCTCCGAAGAC 

PSH ACAAACCCTGGAGCCTCAGGACCAADCTGTGAACGATGTCCTACCTGTGGT 

TNPGASGPTCER CPTC6 
P6GP ACGAACCCT66AGCCTC66GACCCACCTGT6AAC6ATGTCCTACCT6TAGT 

P6H 6GCCAAGCCGGAGAAGAATGTGTGGACAAGTGCAAACTAGCTGGTGCGACC 
6QAGEECVDKCKLAGAT 
P66P 66TCAGCCTGGAGAA6AAT6TGT66ACAAATGCAAACTA6CAGGT6T6ACC 

P V 
P6H CAAGGAGAAAATGAATGTTTAAnACAnCCTAATAACTACAGATAATGAG 
QGENECLITFLITTDNE 
P6GP CAAGGAGAAAATGAATGTCTTATTACATTCCTAATAAGTACA6ATAATGA6 

S 

P6H AACATTCCCATGATCATGTTAGGGGTnCCCTGGCTACTCnCTCATCGGG 
NIPMIML6VSLATLLIG 
PSGP AATATTCCTATGATCATGTTGGGGGnTCACTGGCTA 

P6H GAAGnGCCAAATTTGAAGCAGAACGATCAAAAGCCAAGTGGCAAACGGGA 
EVAKFEAERSKAKWQTG 

B6H AAACACAGGGAAAAACAAAAGGTAGACemCCACAGAnGCTAGAACTAC 
KHREKQKVDLSTDC 

FIG. 3C 
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AGCGTGACT6TGAATATCCCACACTGC6AGA6AA6AAGCAGGCACATTATC 1528, 

SVTVNIPHCERRSRHII H3H 
AATGT6ACTGT6A6TATACCAAACT6T6AGA6AAAAA6CAGGCAT6TTATC S79 

N S N K V 293 

TGCAACTGCGACTGTCAGAAAGAAGTGGAAGTGAACAGCTCCAAATGTCAC 1630 

CNCD CQKEVEV N S S . K C H 458 
TGCAGCTGCGATTGTCAGAAAGAAGTGGAAGTGATOTTTCCAAATGCCAC 981_, 
S 327 
ATGGGGCCTCGCTGTGAGTGTGGCGAGGACATGCTGAGCACAGAnCCTGC 1732 

MGPRCECGEDMLSTDSC 502 
ATGGGCCCTCACTGCGAGTGTGGTGAGGACACGCTGAGCACAGAnCCTGC 1083,^, 
H 361 
GGGCAGTGTATCTGCCACnGTCTCCCTATGGAAACATTTATGGACCTTAT 1834 

GQCICHLSPYGNIYGPY 536 
6GGCA6TGCATCTGCCACTTGTCTCCCTATGGAAACA1TTATGGACCTTAC 1185 

395 

GGTAAC6GC6ACT6TGACT6TGGTGAATGTGTGTGCAGGAGCGGCTGGACT 1936 

6N6DCDC6ECVCRS6WT 570 
GATAAC6GAGACT6T6AATGTG6GGAATGCGT6TGCAGGA6TGGTTGGACC 1287 

D E 429 

GGAGTGCTCTGCAGCGGGCGCGGGGACTGTGTTTGTGGCAAGTGTGTnGC 2038 

G VLCSGRGDCVCGKCVC 604 
G6CACGCTCTGCAGCGG6CGCGGGGACTGCGTCTGTGGCAA6TGTGTCTGC 1389 

T 463 
GACCCCTGTAACTCTAAACGGAGCTGCAnGAGTGCCACCTGTCAGCAGCT 2140 

DPCNSKRSCIECHLSAA 638 
GACCCCTGTAACTCTAAACGGAGCTGCAnGAATGCCACCTGTCTGCAGAT 1491 

S D 497 

ATCAGTGAAGAAGAAGATnCTCAAAGGATGGnCTGTTTCCTGCTCTCTG 2242 

ISEEEDFSKDGSVSCSL 672 
ATCAGCAAAGAAGCAGATTTCTCAAAGGATAGnCTGTTTCCTGCTCCCTG 1593 

K A S 531 

GG6AAAACCATCATTCACAGCATCAATGAAAAAGATTGTCCGAA6CCTCCA 2344 

GKTI IHSINEKDCPKPP 706 
GGAAAAACCATCAnCACAACATCAGTGAAAAAGACTGCCCCAAACCTCCA 1695 

N S 565 
GTTGTCCTACTGTGCATCTGGAAGCTACTGGTGTCATTTCATGATCGTAAA 2446 

VVLLCIWKLLVSFHDRK 740 

1732 

577 

ACCAATCCACTCTACAGAGGATCCACAAGTACTTTTAAAAATGTAACnAT 2548 
TNPLYRGSTSTFKNVTY 774 
nTATGCATAAAAAAAGTCTGnrCACTGATATGAAATGnAATG 2644^^^ 

788 

FIG. 3D 
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